Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
2.
Front Bioinform ; 3: 1101505, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37502697

RESUMEN

Introduction: Investigation of molecular mechanisms of human disorders, especially rare diseases, require exploration of various knowledge repositories for building precise hypotheses and complex data interpretation. Recently, increasingly more resources offer diagrammatic representation of such mechanisms, including disease-dedicated schematics in pathway databases and disease maps. However, collection of knowledge across them is challenging, especially for research projects with limited manpower. Methods: In this article we present an automated workflow for construction of maps of molecular mechanisms for rare diseases. The workflow requires a standardized definition of a disease using Orphanet or HPO identifiers to collect relevant genes and variants, and to assemble a functional, visual repository of related mechanisms, including data overlays. The diagrams composing the final map are unified to a common systems biology format from CellDesigner SBML, GPML and SBML+layout+render. The constructed resource contains disease-relevant genes and variants as data overlays for immediate visual exploration, including embedded genetic variant browser and protein structure viewer. Results: We demonstrate the functionality of our workflow on two examples of rare diseases: Kawasaki disease and retinitis pigmentosa. Two maps are constructed based on their corresponding identifiers. Moreover, for the retinitis pigmentosa use-case, we include a list of differentially expressed genes to demonstrate how to tailor the workflow using omics datasets. Discussion: In summary, our work allows for an ad-hoc construction of molecular diagrams combined from different sources, preserving their layout and graphical style, but integrating them into a single resource. This allows to reduce time consuming tasks of prototyping of a molecular disease map, enabling visual exploration, hypothesis building, data visualization and further refinement. The code of the workflow is open and accessible at https://gitlab.lcsb.uni.lu/minerva/automap/.

3.
Front Neurol ; 14: 1330321, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38174101

RESUMEN

Background: Deep phenotyping of Parkinson's disease (PD) is essential to investigate this fastest-growing neurodegenerative disorder. Since 2015, over 800 individuals with PD and atypical parkinsonism along with more than 800 control subjects have been recruited in the frame of the observational, monocentric, nation-wide, longitudinal-prospective Luxembourg Parkinson's study. Objective: To profile the baseline dataset and to explore risk factors, comorbidities and clinical profiles associated with PD, atypical parkinsonism and controls. Methods: Epidemiological and clinical characteristics of all 1,648 participants divided in disease and control groups were investigated. Then, a cross-sectional group comparison was performed between the three largest groups: PD, progressive supranuclear palsy (PSP) and controls. Subsequently, multiple linear and logistic regression models were fitted adjusting for confounders. Results: The mean (SD) age at onset (AAO) of PD was 62.3 (11.8) years with 15% early onset (AAO < 50 years), mean disease duration 4.90 (5.16) years, male sex 66.5% and mean MDS-UPDRS III 35.2 (16.3). For PSP, the respective values were: 67.6 (8.2) years, all PSP with AAO > 50 years, 2.80 (2.62) years, 62.7% and 53.3 (19.5). The highest frequency of hyposmia was detected in PD followed by PSP and controls (72.9%; 53.2%; 14.7%), challenging the use of hyposmia as discriminating feature in PD vs. PSP. Alcohol abstinence was significantly higher in PD than controls (17.6 vs. 12.9%, p = 0.003). Conclusion: Luxembourg Parkinson's study constitutes a valuable resource to strengthen the understanding of complex traits in the aforementioned neurodegenerative disorders. It corroborated several previously observed clinical profiles, and provided insight on frequency of hyposmia in PSP and dietary habits, such as alcohol abstinence in PD.Clinical trial registration: clinicaltrials.gov, NCT05266872.

4.
Front Immunol ; 13: 1002629, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36439150

RESUMEN

Immune mediated inflammatory diseases (IMIDs) are a heterogeneous group of debilitating, multifactorial and unrelated conditions featured by a dysregulated immune response leading to destructive chronic inflammation. The immune dysregulation can affect various organ systems: gut (e.g., inflammatory bowel disease), joints (e.g., rheumatoid arthritis), skin (e.g., psoriasis, atopic dermatitis), resulting in significant morbidity, reduced quality of life, increased risk for comorbidities, and premature death. As there are no reliable disease progression and therapy response biomarkers currently available, it is very hard to predict how the disease will develop and which treatments will be effective in a given patient. In addition, a considerable proportion of patients do not respond sufficiently to the treatment. ImmUniverse is a large collaborative consortium of 27 partners funded by the Innovative Medicine Initiative (IMI), which is sponsored by the European Union (Horizon 2020) and in-kind contributions of participating pharmaceutical companies within the European Federation of Pharmaceutical Industries and Associations (EFPIA). ImmUniverse aims to advance our understanding of the molecular mechanisms underlying two immune-mediated diseases, ulcerative colitis (UC) and atopic dermatitis (AD), by pursuing an integrative multi-omics approach. As a consequence of the heterogeneity among IMIDs patients, a comprehensive, evidence-based identification of novel biomarkers is necessary to enable appropriate patient stratification that would account for the inter-individual differences in disease severity, drug efficacy, side effects or prognosis. This would guide clinicians in the management of patients and represent a major step towards personalized medicine. ImmUniverse will combine the existing and novel advanced technologies, including multi-omics, to characterize both the tissue microenvironment and blood. This comprehensive, systems biology-oriented approach will allow for identification and validation of tissue and circulating biomarker signatures as well as mechanistic principles, which will provide information about disease severity and future disease progression. This truly makes the ImmUniverse Consortium an unparalleled approach.


Asunto(s)
Dermatitis Atópica , Medicina de Precisión , Humanos , Calidad de Vida , Biomarcadores , Progresión de la Enfermedad
5.
Bioinformatics ; 38(4): 1171-1172, 2022 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-34791064

RESUMEN

SUMMARY: COBREXA.jl is a Julia package for scalable, high-performance constraint-based reconstruction and analysis of very large-scale biological models. Its primary purpose is to facilitate the integration of modern high performance computing environments with the processing and analysis of large-scale metabolic models of challenging complexity. We report the architecture of the package, and demonstrate how the design promotes analysis scalability on several use-cases with multi-organism community models. AVAILABILITY AND IMPLEMENTATION: https://doi.org/10.17881/ZKCR-BT30. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Metodologías Computacionales , Programas Informáticos , Modelos Biológicos
6.
Drug Discov Today ; 26(3): 626-630, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33338655

RESUMEN

Translational research today is data-intensive and requires multi-stakeholder collaborations to generate and pool data together for integrated analysis. This leads to the challenge of harmonization of data from different sources with different formats and standards, which is often overlooked during project planning and thus becomes a bottleneck of the research progress. We report on our experience and lessons learnt about data curation for translational research garnered over the course of the European Translational Research Infrastructure & Knowledge management Services (eTRIKS) program (https://www.etriks.org), a unique, 5-year, cross-organizational, cross-cultural collaboration project funded by the Innovative Medicines Initiative of the EU. Here, we discuss the obstacles and suggest what steps are needed for effective data curation in translational research, especially for projects involving multiple organizations from academia and industry.


Asunto(s)
Conducta Cooperativa , Curaduría de Datos , Investigación Biomédica Traslacional/organización & administración , Comparación Transcultural , Humanos
7.
Gigascience ; 9(11)2020 11 18.
Artículo en Inglés | MEDLINE | ID: mdl-33205814

RESUMEN

BACKGROUND: The amount of data generated in large clinical and phenotyping studies that use single-cell cytometry is constantly growing. Recent technological advances allow the easy generation of data with hundreds of millions of single-cell data points with >40 parameters, originating from thousands of individual samples. The analysis of that amount of high-dimensional data becomes demanding in both hardware and software of high-performance computational resources. Current software tools often do not scale to the datasets of such size; users are thus forced to downsample the data to bearable sizes, in turn losing accuracy and ability to detect many underlying complex phenomena. RESULTS: We present GigaSOM.jl, a fast and scalable implementation of clustering and dimensionality reduction for flow and mass cytometry data. The implementation of GigaSOM.jl in the high-level and high-performance programming language Julia makes it accessible to the scientific community and allows for efficient handling and processing of datasets with billions of data points using distributed computing infrastructures. We describe the design of GigaSOM.jl, measure its performance and horizontal scaling capability, and showcase the functionality on a large dataset from a recent study. CONCLUSIONS: GigaSOM.jl facilitates the use of commonly available high-performance computing resources to process the largest available datasets within minutes, while producing results of the same quality as the current state-of-art software. Measurements indicate that the performance scales to much larger datasets. The example use on the data from a massive mouse phenotyping effort confirms the applicability of GigaSOM.jl to huge-scale studies.


Asunto(s)
Algoritmos , Lenguajes de Programación , Animales , Análisis por Conglomerados , Ratones , Programas Informáticos
8.
Front Neurol ; 11: 524, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32655481

RESUMEN

Over the past two decades, our understanding of Parkinson's disease (PD) has been gleaned from the discoveries made in familial and/or sporadic forms of PD in the Caucasian population. The transferability and the clinical utility of genetic discoveries to other ethnically diverse populations are unknown. The Indian population has been under-represented in PD research. The Genetic Architecture of PD in India (GAP-India) project aims to develop one of the largest clinical/genomic bio-bank for PD in India. Specifically, GAP-India project aims to: (1) develop a pan-Indian deeply phenotyped clinical repository of Indian PD patients; (2) perform whole-genome sequencing in 500 PD samples to catalog Indian genetic variability and to develop an Indian PD map for the scientific community; (3) perform a genome-wide association study to identify novel loci for PD and (4) develop a user-friendly web-portal to disseminate results for the scientific community. Our "hub-spoke" model follows an integrative approach to develop a pan-Indian outreach to develop a comprehensive cohort for PD research in India. The alignment of standard operating procedures for recruiting patients and collecting biospecimens with international standards ensures harmonization of data/bio-specimen collection at the beginning and also ensures stringent quality control parameters for sample processing. Data sharing and protection policies follow the guidelines established by local and national authorities.We are currently in the recruitment phase targeting recruitment of 10,200 PD patients and 10,200 healthy volunteers by the end of 2020. GAP-India project after its completion will fill a critical gap that exists in PD research and will contribute a comprehensive genetic catalog of the Indian PD population to identify novel targets for PD.

9.
Front Aging Neurosci ; 10: 326, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30420802

RESUMEN

While genetic advances have successfully defined part of the complexity in Parkinson's disease (PD), the clinical characterization of phenotypes remains challenging. Therapeutic trials and cohort studies typically include patients with earlier disease stages and exclude comorbidities, thus ignoring a substantial part of the real-world PD population. To account for these limitations, we implemented the Luxembourg PD study as a comprehensive clinical, molecular and device-based approach including patients with typical PD and atypical parkinsonism, irrespective of their disease stage, age, comorbidities, or linguistic background. To provide a large, longitudinally followed, and deeply phenotyped set of patients and controls for clinical and fundamental research on PD, we implemented an open-source digital platform that can be harmonized with international PD cohort studies. Our interests also reflect Luxembourg-specific areas of PD research, including vision, gait, and cognition. This effort is flanked by comprehensive biosampling efforts assuring high quality and sustained availability of body liquids and tissue biopsies. We provide evidence for the feasibility of such a cohort program with deep phenotyping and high quality biosampling on parkinsonism in an environment with structural specificities and alert the international research community to our willingness to collaborate with other centers. The combination of advanced clinical phenotyping approaches including device-based assessment will create a comprehensive assessment of the disease and its variants, its interaction with comorbidities and its progression. We envision the Luxembourg Parkinson's study as an important research platform for defining early diagnosis and progression markers that translate into stratified treatment approaches.

10.
Nucleic Acids Res ; 45(20): 11495-11514, 2017 Nov 16.
Artículo en Inglés | MEDLINE | ID: mdl-29059321

RESUMEN

The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the 'unknown enzyme problem'. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research.


Asunto(s)
Biocatálisis , Genoma Fúngico/genética , Genoma Humano/genética , Metaboloma/genética , Saccharomyces cerevisiae/enzimología , Secuencia de Bases , Mapeo Cromosómico , Bases de Datos Genéticas , Bases de Datos de Proteínas , Enzimas/análisis , Enzimas/genética , Humanos , Metabolómica/métodos , Proteoma/genética , Sitios de Carácter Cuantitativo , Saccharomyces cerevisiae/genética
11.
Bioinformatics ; 33(12): 1852-1858, 2017 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-28200120

RESUMEN

MOTIVATION: The extraction of sequence variants from the literature remains an important task. Existing methods primarily target standard (ST) mutation mentions (e.g. 'E6V'), leaving relevant mentions natural language (NL) largely untapped (e.g. 'glutamic acid was substituted by valine at residue 6'). RESULTS: We introduced three new corpora suggesting named-entity recognition (NER) to be more challenging than anticipated: 28-77% of all articles contained mentions only available in NL. Our new method nala captured NL and ST by combining conditional random fields with word embedding features learned unsupervised from the entire PubMed. In our hands, nala substantially outperformed the state-of-the-art. For instance, we compared all unique mentions in new discoveries correctly detected by any of three methods (SETH, tmVar, or nala ). Neither SETH nor tmVar discovered anything missed by nala , while nala uniquely tagged 33% mentions. For NL mentions the corresponding value shot up to 100% nala -only. AVAILABILITY AND IMPLEMENTATION: Source code, API and corpora freely available at: http://tagtog.net/-corpora/IDP4+ . CONTACT: nala@rostlab.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Minería de Datos/métodos , Mutación , Procesamiento de Lenguaje Natural , Programas Informáticos , Humanos , PubMed , Aprendizaje Automático no Supervisado
13.
JAKSTAT ; 4(1): e1062596, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26413425

RESUMEN

Aberrant activation of oncogenic kinases is frequently observed in human cancers, but the underlying mechanism and resulting effects on global signaling are incompletely understood. Here, we demonstrate that the oncogenic FIP1L1-PDGFRα kinase exhibits a significantly different signaling pattern compared to its PDGFRα wild type counterpart. Interestingly, the activation of primarily membrane-based signal transduction processes (such as PI3-kinase- and MAP-kinase- pathways) is remarkably shifted toward a prominent activation of STAT factors. This diverging signaling pattern compared to classical PDGF-receptor signaling is partially coupled to the aberrant cytoplasmic localization of the oncogene, since membrane targeting of FIP1L1-PDGFRα restores activation of MAPK- and PI3K-pathways. In stark contrast to the classical cytokine-induced STAT activation process, STAT activation by FIP1L1-PDGFRα does neither require Janus kinase activity nor Src kinase activity. Furthermore, we investigated the mechanism of STAT5 activation via FIP1L1-PDGFRα in more detail and found that STAT5 activation does not involve an SH2-domain-mediated binding mechanism. We thus demonstrate that STAT5 activation occurs via a non-canonical activation mechanism in which STAT5 may be subject to a direct phosphorylation by FIP1L1-PDGFRα.

14.
Nat Commun ; 6: 7866, 2015 Jul 22.
Artículo en Inglés | MEDLINE | ID: mdl-26198319

RESUMEN

Cell-to-cell communication across multiple cell types and tissues strictly governs proper functioning of metazoans and extensively relies on interactions between secreted ligands and cell-surface receptors. Herein, we present the first large-scale map of cell-to-cell communication between 144 human primary cell types. We reveal that most cells express tens to hundreds of ligands and receptors to create a highly connected signalling network through multiple ligand-receptor paths. We also observe extensive autocrine signalling with approximately two-thirds of partners possibly interacting on the same cell type. We find that plasma membrane and secreted proteins have the highest cell-type specificity, they are evolutionarily younger than intracellular proteins, and that most receptors had evolved before their ligands. We provide an online tool to interactively query and visualize our networks and demonstrate how this tool can reveal novel cell-to-cell interactions with the prediction that mast cells signal to monoblastic lineages via the CSF1-CSF1R interacting pair.


Asunto(s)
Comunicación Celular , Receptores de Superficie Celular/metabolismo , Animales , Evolución Molecular , Humanos , Ligandos , Programas Informáticos
15.
Cell Commun Signal ; 13: 21, 2015 Mar 31.
Artículo en Inglés | MEDLINE | ID: mdl-25880691

RESUMEN

BACKGROUND: Gastrointestinal stromal tumours (GIST) are mainly characterised by the presence of activating mutations in either of the two receptor tyrosine kinases c-KIT or platelet-derived growth factor receptor-α (PDGFRα). Most mechanistic studies dealing with GIST mutations have focused on c-KIT and far less is known about the signalling characteristics of the mutated PDGFRα proteins. Here, we study the signalling capacities and corresponding transcriptional responses of the different PDGFRα proteins under comparable genomic conditions. RESULTS: We demonstrate that the constitutive signalling via the oncogenic PDGFRα mutants favours a mislocalisation of the receptors and that this modifies the signalling characteristics of the mutated receptors. We show that signalling via the oncogenic PDGFRα mutants is not solely characterised by a constitutive activation of the conventional PDGFRα signalling pathways. In contrast to wild-type PDGFRα signal transduction, the activation of STAT factors (STAT1, STAT3 and STAT5) is an integral part of signalling mediated via mutated PDGF-receptors. Furthermore, this unconventional STAT activation by mutated PDGFRα is already initiated in the endoplasmic reticulum whereas the conventional signalling pathways rather require cell surface expression of the receptor. Finally, we demonstrate that the activation of STAT factors also translates into a biologic response as highlighted by the induction of STAT target genes. CONCLUSION: We show that the overall oncogenic response is the result of different signatures emanating from different cellular compartments. Furthermore, STAT mediated responses are an integral part of mutated PDGFRα signalling.


Asunto(s)
Neoplasias Gastrointestinales/metabolismo , Mutación , Proteínas de Neoplasias/metabolismo , Receptor alfa de Factor de Crecimiento Derivado de Plaquetas/metabolismo , Factores de Transcripción STAT/metabolismo , Transducción de Señal , Línea Celular Tumoral , Retículo Endoplásmico/genética , Retículo Endoplásmico/metabolismo , Retículo Endoplásmico/patología , Activación Enzimática/genética , Neoplasias Gastrointestinales/genética , Neoplasias Gastrointestinales/patología , Humanos , Proteínas de Neoplasias/genética , Receptor alfa de Factor de Crecimiento Derivado de Plaquetas/genética , Factores de Transcripción STAT/genética
16.
Nat Chem Biol ; 11(5): 347-354, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25848931

RESUMEN

Huntington's disease (HD) is a currently incurable neurodegenerative condition caused by an abnormally expanded polyglutamine tract in huntingtin (HTT). We identified new modifiers of mutant HTT toxicity by performing a large-scale 'druggable genome' siRNA screen in human cultured cells, followed by hit validation in Drosophila. We focused on glutaminyl cyclase (QPCT), which had one of the strongest effects on mutant HTT-induced toxicity and aggregation in the cell-based siRNA screen and also rescued these phenotypes in Drosophila. We found that QPCT inhibition induced the levels of the molecular chaperone αB-crystallin and reduced the aggregation of diverse proteins. We generated new QPCT inhibitors using in silico methods followed by in vitro screening, which rescued the HD-related phenotypes in cell, Drosophila and zebrafish HD models. Our data reveal a new HD druggable target affecting mutant HTT aggregation and provide proof of principle for a discovery pipeline from druggable genome screen to drug development.


Asunto(s)
Aminoaciltransferasas/efectos de los fármacos , Aminoaciltransferasas/genética , Enfermedad de Huntington/tratamiento farmacológico , Enfermedad de Huntington/genética , ARN Interferente Pequeño , Aminoaciltransferasas/antagonistas & inhibidores , Animales , Células Cultivadas , Biología Computacional , Drosophila , Evaluación Preclínica de Medicamentos , Inhibidores Enzimáticos/farmacología , Inhibidores Enzimáticos/uso terapéutico , Proteínas Fluorescentes Verdes/metabolismo , Humanos , Proteína Huntingtina , Ratones , Ratones Endogámicos C57BL , Mutación/genética , Proteínas del Tejido Nervioso/genética , Proteínas del Tejido Nervioso/metabolismo , Neuronas/efectos de los fármacos , Neuronas/metabolismo , Pez Cebra , Cadena B de alfa-Cristalina/metabolismo
18.
NPJ Biofilms Microbiomes ; 1: 15007, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-28721231

RESUMEN

BACKGROUND: Mixed microbial communities underpin important biotechnological processes such as biological wastewater treatment (BWWT). A detailed knowledge of community structure and function relationships is essential for ultimately driving these systems towards desired outcomes, e.g., the enrichment in organisms capable of accumulating valuable resources during BWWT. METHODS: A comparative integrated omic analysis including metagenomics, metatranscriptomics and metaproteomics was carried out to elucidate functional differences between seasonally distinct oleaginous mixed microbial communities (OMMCs) sampled from an anoxic BWWT tank. A computational framework for the reconstruction of community-wide metabolic networks from multi-omic data was developed. These provide an overview of the functional capabilities by incorporating gene copy, transcript and protein abundances. To identify functional genes, which have a disproportionately important role in community function, we define a high relative gene expression and a high betweenness centrality relative to node degree as gene-centric and network topological features, respectively. RESULTS: Genes exhibiting high expression relative to gene copy abundance include genes involved in glycerolipid metabolism, particularly triacylglycerol lipase, encoded by known lipid accumulating populations, e.g., CandidatusMicrothrix parvicella. Genes with a high relative gene expression and topologically important positions in the network include genes involved in nitrogen metabolism and fatty acid biosynthesis, encoded by Nitrosomonas spp. and Rhodococcus spp. Such genes may be regarded as 'keystone genes' as they are likely to be encoded by keystone species. CONCLUSION: The linking of key functionalities to community members through integrated omics opens up exciting possibilities for devising prediction and control strategies for microbial communities in the future.

19.
PLoS Comput Biol ; 10(12): e1003951, 2014 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-25474213

RESUMEN

Huge research effort has been invested over many years to determine the phenotypes of natural or artificial mutations in HIV proteins--interpretation of mutation phenotypes is an invaluable source of new knowledge. The results of this research effort are recorded in the scientific literature, but it is difficult for virologists to rapidly find it. Manually locating data on phenotypic variation within the approximately 270,000 available HIV-related research articles, or the further 1,500 articles that are published each month is a daunting task. Accordingly, the HIV research community would benefit from a resource cataloguing the available HIV mutation literature. We have applied computational text-mining techniques to parse and map mutagenesis and polymorphism information from the HIV literature, have enriched the data with ancillary information and have developed a public, web-based interface through which it can be intuitively explored: the HIV mutation browser. The current release of the HIV mutation browser describes the phenotypes of 7,608 unique mutations at 2,520 sites in the HIV proteome, resulting from the analysis of 120,899 papers. The mutation information for each protein is organised in a residue-centric manner and each residue is linked to the relevant experimental literature. The importance of HIV as a global health burden advocates extensive effort to maximise the efficiency of HIV research. The HIV mutation browser provides a valuable new resource for the research community. The HIV mutation browser is available at: http://hivmut.org.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Infecciones por VIH/virología , VIH-1/genética , Mutación/genética , Secuencia de Aminoácidos , Humanos , Datos de Secuencia Molecular
20.
Bioinformatics ; 30(22): 3249-56, 2014 Nov 15.
Artículo en Inglés | MEDLINE | ID: mdl-25100685

RESUMEN

SUMMARY: The iterative process of finding relevant information in biomedical literature and performing bioinformatics analyses might result in an endless loop for an inexperienced user, considering the exponential growth of scientific corpora and the plethora of tools designed to mine PubMed(®) and related biological databases. Herein, we describe BioTextQuest(+), a web-based interactive knowledge exploration platform with significant advances to its predecessor (BioTextQuest), aiming to bridge processes such as bioentity recognition, functional annotation, document clustering and data integration towards literature mining and concept discovery. BioTextQuest(+) enables PubMed and OMIM querying, retrieval of abstracts related to a targeted request and optimal detection of genes, proteins, molecular functions, pathways and biological processes within the retrieved documents. The front-end interface facilitates the browsing of document clustering per subject, the analysis of term co-occurrence, the generation of tag clouds containing highly represented terms per cluster and at-a-glance popup windows with information about relevant genes and proteins. Moreover, to support experimental research, BioTextQuest(+) addresses integration of its primary functionality with biological repositories and software tools able to deliver further bioinformatics services. The Google-like interface extends beyond simple use by offering a range of advanced parameterization for expert users. We demonstrate the functionality of BioTextQuest(+) through several exemplary research scenarios including author disambiguation, functional term enrichment, knowledge acquisition and concept discovery linking major human diseases, such as obesity and ageing. AVAILABILITY: The service is accessible at http://bioinformatics.med.uoc.gr/biotextquest. CONTACT: g.pavlopoulos@gmail.com or georgios.pavlopoulos@esat.kuleuven.be SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Minería de Datos/métodos , Programas Informáticos , Autoria , Análisis por Conglomerados , Enfermedad/genética , Genes , Humanos , Internet , Medical Subject Headings , Proteínas , PubMed , Publicaciones
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...